618 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC-BY
Size:
26200 sentences Production Status:
Newly created-finished
Use:
Named Entity Recognition
-
Paper title:NoSta-D Named Entity Annotation for German: Guidelines and Dataset
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Darina Benikova | FG Language Technology, TU Darmstadt | DE | ||
| Author 2 | Chris Biemann | TU Darmstadt | DE | University of Hamburg | DE |
| Author 3 | Marc Reznicek | Universidad Complutense de Madrid | ES | ||
| Main Contact | Chris Biemann | University of Hamburg | None |
Documentation:
yes, extensive guidelines and metadata
Written
Lexicon,
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Separating Brands from Types: an Investigation of Different Features for the Food Domain
-
Paper track:IE/database linking
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Michael Wiegand | Saarland University | DE | ||||
| Author 2 | Dietrich Klakow | Saarland University | None | Universität des Saarlandes | None | Saarland University, Spoken Language Systems | None |
| Main Contact | Michael Wiegand | Saarland University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
German
Availability:
From Owner
License:
free for research and educational purposes
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-finished
Use:
converts constituent trees to dependency trees
-
Paper title:Making Ellipses Explicit in Dependency Conversion for a German Treebank
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Wolfgang Seeker | Universität Stuttgart, IMS | None |
| Author 2 | Jonas Kuhn | Universität Stuttgart, IMS | None |
| Main Contact | Wolfgang Seeker | University of Stuttgart | DE |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
English German french
Availability:
Freely Available
License:
<Not Specified>
Size:
16 hours Production Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues
-
Paper track:Speech
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Volha Petukhova | Saarland University | DE |
| Author 10 | Olivier Deroo | Acapela Group S.A. | BE |
| Author 11 | Ronny Egeler | Sikom Software GmbH | DE |
| Author 12 | Uwe Meinz | Sikom Software GmbH | DE |
| Author 13 | Steffen Liersch | Sikom Software GmbH | DE |
| Author 14 | Anna Schmidt | Saarland Univeristy | DE |
| Author 2 | Martin Gropp | Saarland University | DE |
| Author 3 | Dietrich Klakow | Saarland University | DE |
| Author 4 | Gregor Eigner | Mipumi Games GmbH | AT |
| Author 5 | Mario Topf | Mipumi Games GmbH | AT |
| Author 6 | Stefan Srb | Mipumi Games GmbH | AT |
| Author 7 | Petr Motlicek | Idiap Research Institute | CH |
| Author 8 | Blaise Potard | Idiap Research Institute | CH |
| Author 9 | John Dines | KOEMEI SA | CH |
| Main Contact | Volha Petukhova | Saarland University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
German
Availability:
Not Available
License:
<Not Specified>
Size:
1.08 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Towards an Automatic Assessment of Crowdsourced Data for NLU
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Patricia Braunger | Daimler AG | DE |
| Author 2 | Wolfgang Maier | Daimler AG | DE |
| Author 3 | Jan Wessling | Daimler AG | DE |
| Author 4 | Maria Schmidt | Daimler AG | DE |
| Main Contact | Patricia Braunger | Daimler AG | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
German Swiss German
Availability:
Not Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Kyoko Sugisaki | University of Zurich | CH |
| Author 2 | Nicolas Wiedmer | University of Zurich | CH |
| Author 3 | Heiko Hausendorf | University of Zurich | CH |
| Main Contact | Kyoko Sugisaki | University of Zurich | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CreativeCommons
Size:
124116 tokens Production Status:
Newly created-finished
Use:
Machine Learning
-
Paper title:TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Andy Luecking | Goethe University Frankfurt | DE |
| Author 2 | Armin Hoenen | Text Technology Lab Goethe University Frankfurt am Main | DE |
| Author 3 | Alexander Mehler | Goethe-University Frankfurt am Main | DE |
| Main Contact | Andy Luecking | Goethe University Frankfurt | None |
Documentation:
<Not Specified>
Written
Grammar/Language Model,
Language Type:
Multilingual
Languages:
German
Availability:
freely available, but subject to requester having obtained the license for the original TIGER treebank
License:
terms and conditions for original TIGER treebank apply
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:German and English Treebanks and Lexica for Tree-Adjoining Grammars
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Miriam Kaeshammer | University of Düsseldorf, Institut für Sprache und Information | None |
| Author 2 | Vera Demberg | Saarland University, Department of Computational Linguistics | None |
| Main Contact | Miriam Kaeshammer | Heinrich-Heine-Universität Düsseldorf | DE |
Documentation:
'in ''German and English Treebanks and Lexica for Tree-Adjoining Grammars, Kaeshammer and Demberg (to appear)'' and with the grammar itself'Language Type:
Trilingual
Languages:
English German italian
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
<Not Specified>
-
Paper title:KRAUTS: A German Temporally Annotated News Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jannik Strötgen | Heidelberg University | DE | Max-Planck-Institut für Informatik | DE |
| Author 2 | Anne-Lyse Minard | IRISA, CNRS | FR | ||
| Author 3 | Lukas Lange | Max-Planck-Institut für Informatik | DE | ||
| Author 4 | Manuela Speranza | FBK | IT | ||
| Author 5 | Bernardo Magnini | FBK | IT | ||
| Main Contact | Jannik Strötgen | Max-Planck-Institut für Informatik | None | Bosch Center for Artificial Intelligence | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
MIT
Size:
17 MByte Production Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:PotTS: The Potsdam Twitter Sentiment Corpus
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Uladzimir Sidarenka | University of Potsdam | DE |
| Main Contact | Uladzimir Sidarenka | University of Potsdam | None |
Documentation:
annotation guidelines (in English) are provided along with the dataset (cf. corpus/doc)




